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Shewanella decolorationis is a valuable microorganism for degrading diverse synthetic textile dyes. Here, we present an anno- 
tated draft genome sequence of S. decolorationis SI 2, which contains 4,219 protein-coding genes and 86 structural RNAs. This 
information regarding the genetic basis of this bacterium can greatly advance our understanding of the physiology of this spe- 
cies. 
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The genus Shewanella is famous for bioremediation of environ- 
ments contaminated with toxic organic and inorganic chemi- 
cals and has been found in water, sediment, and surfaces of shale 
and sandstone (1). Recent studies have demonstrated that She- 
wanella decolorationis is a powerful species to degrade diverse 
manufactured textile dyes, including azo and anthraquinone dye, 
which are carcinogenic to humans (2^1). S. decolorationis S12 was 
isolated in 2002 from activated sludge of a textile-printing waste- 
water treatment plant in Guangzhou, China (3). Whole-genome 
sequences of S. decolorationis have not been available, which is a 
limitation to the genetic study of this useful species. 

Genomic DNA extracted from S. decolorationis S12 was se- 
quenced on an Illumina Miseq sequencer according to Illumi- 
na's recommendations. A total of 16,692,489 paired-end 250- 
base reads were generated. The draft genome was assembled 
using Velvet (5) and Newbler, containing 77 contigs of 
>250 bp each after manual curation, with a total of 4,843,449 
bases, an N 50 of 155.5 kbp, and a G+C content of 47.1%. The 
draft genome sequence of strain S12 was annotated using 
Blast2GO (6) and the NCBI Prokaryotic Genome Automatic An- 
notation Pipeline (PGAAP) (7, 8). 

The draft genome sequence contains 4,305 genes, including 
4,219 predicted coding sequences (CDS), 9 rRNA genes, and 77 
tRNA genes. A total of 3, 1 84 CDS were classified using the Cluster 
of Orthologous Groups (COG) database (9). The most abundant 
groups of proteins were those that are involved in signal transduc- 
tion (COG category T [9.5%] ), amino acid metabolism and trans- 
port (COG category E [9.0%]), transcription (COG category K 
[8.5%]), and energy production and conversion (COG category C 
[7.8%]). The genome sequence of S. decolorationis S12 serves as a 



basis for further investigation of the molecular basis of its poten- 
tial in dye degradation. 

Currently, the genus Shewanella contains 45 type species, and 
28 strains from 15 type species have been sequenced for the whole 
genomes. Unlike these species from natural environments, S. de- 
colorationis S12 was isolated from a human-altered and highly 
contaminated system, and thus its genetic evolution and cell re- 
sponses might be different. Comparative genomics analysis 
showed that S. decolorationis S12 was closest to Shewanella sp. 
strain ANA-3 (89% coding sequence similarity), followed by She- 
wanella sp. strain MR-7 (88%) and Shewanella sp. strain MR-4 
(88%). Further analysis will provide a better understanding of the 
S. decolorationis genome and shed more light on genomic evolu- 
tion of the Shewanella genus. 

Nucleotide sequence accession numbers. This whole-genome 
shotgun project has been deposited at DDBJ/EMBL/GenBank un- 
der accession number AXZL00000000. The version described in 
this paper is the first version, AXZL01000000. 
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